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Abstract 

Let (Xi,i > 1) be a sequence of i.i.d. random variables with values in [0, 1], and / be 
a function such that E(f(Xi) 2 ) < +oo. We show a functional central limit theorem for the 
process 1 1-> Yh=i /PQLy,^ 
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1 Introduction 

Let (Xi, X2, ■ ■ ■) be a sequence of i.i.d. random variables (r.v.) with values in [0,1], having 
distribution fi, distribution function F, and defined on a common probability space (Q,A,¥) 
on which the expectation operator is denoted E. In this paper we are interested in proving a 
functional limit theorem for the sequence of processes (Z n ,n > 1) defined by 

1 n 

Z n (t) := -Yj(Xi)l Xi < t , te[0,l] (1) 



■/?. 



i=l 



where / : [0, 1] — > R is a measurable function. Let (Xj, 1 < i < n) be the sequence (Xj, 1 < i < n) 
sorted in increasing order, and for any t S [0, 1], denote by 

N n (t) = #{i :l<i<n,Xi<t} 

the number of X^s smaller than t. Clearly, for t € [0, 1], 

, N n (t) 

Hence, Z n encodes the partial sums of functions of sorted i.i.d. r.v., as mentioned in the title of 
this paper. In order to state a central limit theorem for Z n the existence of Var(/(Xi)) < +00 is 
clearly needed, but it is not sufficient to control the fluctuations of Z n on all intervals. Standard 
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considerations about the binomial distribution implies that Nnfa) — N n (ti) is quite concentrated 
around n(F{p2) — F(t\)) (for t\ < t^). Conditionally on (N n (t\), N n {t2)) = (^1,^2)1 

Z n {h) - Z n (h) W - ]T f( X (trM( k )) ( 2 ) 
k=\ 

where = means "equals in distribution", and where (Xu ltt2 ](k), 1 < k < n-i — n\) is a family of 
i.i.d. r.v., whose common distribution is that of X conditional on X £ (ti,^]- Hence, to get a 
functional central limit theorem for Z n , the variances of these distributions need to be controlled. 
The following hypothesis Hyp is designed for that purpose: 

Hyp: there exists an increasing function T : [0, 1] — > K + such that: 

x/T(x) is bounded, 
T(x) ln(x) — ► 0, 

V/ interval C [0; 1], Var (f{X) \X el)< ^7jy^ 

where \/ar(g(X) \ X £ I) denotes the variance of g(X) conditional on X £ I (by 
convention, we set E(g(X) \ X G I) = when F(X G I) = 0). 

When / is bounded by 7 on [0; 1], the function T(x) = 7 2 x satisfies Hyp (see also the discussion 
below Theorem 1). 

Consider the mean of Z n 

Z(t) := E(Z n (t)) = E (f(X)l x < t ) , (3) 

(this can be shown to be a cadlag process when E(|/(X)|) < +00) and 

Y n (t) = ^[Z n {t)-Z{t)\. (4) 

The aim of this paper is to show the following result : 

Theorem 1. Let (Xi,i > 0) be a sequence of i.i.d. r.v. taking their values in [0,1] and f : 
[0, 1] — > R a measurable function satisfying Hyp, then 

Y n «h Y 

n 

in D[0, 1], the space of cadlag functions on [0,1 J equipped with the Skorokhod topology, where 
(Yf,t S [0, 1]) is a centered Gaussian process with variance function 

Var(Y s ) = F(s)Var(/(X)|X < s) + F(s)(l - F(s))E(f(X) \ X < s) 2 (5) 

and with covariance function, for < s < t < 1 

Cov(Y s ,Y t -Y s ) = -F(s)(F(t)-F{s))E{f(X)\X < s)E(f (X) \ s < X <t). (6) 
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We discuss a bit the conditions in the theorem. Assume that the X^s are i.i.d. uniform 
on [0,1], and that f(x) = l/x a for some a > 0. The r.v. f(X) = 1/X a possesses a variance 
iff a < 1/2, and then it is in the domain of attraction of the normal distribution only in this 
case (Theorem 1 needs this hypothesis for the convergence of Y n (l)). The largest Var(f(X)\X E 
(a, a + e)) is obtained for a = 0, in which case we get 



y 3 r(f(X)\X e(0,e}) 



e~ 2a a 2 



(l-2a)(l-a) 2 ' 



and one can check that a < 1/2 is also the condition for the existence of a function T satisfying 
Hyp. Hyp appears to be a minimal assumption in that sense. 

The first result concerning the convergence of empirical processes is due to Donsker's Theo- 
rem [2]. It says that when / is constant equal to 1, then Y n converges in D[0, 1] to the standard 
Brownian bridge b up to a time change. A kind of miracle arises then, since the same analysis 
works for all distributions fi by a simple time change. This is not the case here. 

Apart from strong convergence theorems a la Komlos-Major-Tusnady [4], modern results 
about the convergence of empirical processes - see Shorack & Wellner [7] and van der Vaart & 
Wellner [8] - much rely on the concept of Donsker classes, which we discuss below. 

Denote by P n = ^ X^fe=i $Xi the empirical measure associated with the sample (X{, 1 < i < 
n). As a measure, P n operates on any set T of measurable functions (ft : [0, 1] —> R, 

r. n 

F n = / (ft{x)aT n {x) =J2<f>(Xi)/n. 
Jx fc=i 



The empirical process is the signed measure G n := y/n(J? n — fi). By the standard central limit 
theorem, for a given function (f> (such that [icf) 2 < +oo), G n (f) ^-K Af(0,[i((f) — ^<p) 2 ), where 

n 

J\f(m,a 2 ) designates the normal distribution with mean m and variance a 2 . 

A P- Donsker class is a set of measurable functions T such that (G n (j>, (ft G J 7 ) converges in 
distribution to (G(ft, (ft £ J 7 ), in the L ra topology (it is a central limit theorem for a process index 
by a set of functions). This means that : 

• the convergence of the finite dimensional distributions holds : (meaning that for any k, any 

(ft i , . . . , (ftk £ (G n (fti, . . . , G n (ftk) — ■> N := (N\, . . . , Nk) and iV is a centered Gaussian 

n 

vector with covariance matrix Cov(Aj, Nj) = [i [{(fti — fi<fti)(<ftj — H<ftj)\- 

• the sequence (G n cft, (ft £ J 7 ) is tight in L^. 

The proof that a set forms a Donsker class is usually not that simple, and numerous criteria can 
be found in the literature. In our case, the set of functions T is the following one : 

J) = {(x ' ^ (ft t {x) = f(x)l x < t ),t G [0, 1]}. 
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We were unable to find such a criterion for this class, but notice that if such a result existed, 
it would imply Theorem 1 only for the topology L^, a topology which is weaker than ours. Of 
course, Theorem 1 implies that J-f forms a Donsker class. 

Note. In fact classes Tj for non decreasing /, or for functions / whose level sets are given by 
two intervals at most (such that x i— > x 2 , x i— >■ cos(27rx), x i— y sin(27rx)) are Donsker, since they 
are VC subgraph class (see Vapnik & Chervonenkis [9]). 1 

If we consider the variables X^s in the formula (1), as random times, then Z n {t) corresponds 
(up to the normalisation) to the sum of f(Xi) for all events X, appearing before time t, where / 
is some cost function. The process Y n appears to be the suitable tool to measure the fluctuations 
of Z n . 

We would like to mention [5], a work at the origin of the present paper, written by the same 
authors. In [5], the convergence of rescaled trajectories made with sorted increments (in C) to a 
deterministic convex is shown. For this purpose a weaker version of Theorem 1 is established. 

We provide a proof of our theorem in an old fashioned style. We prove the convergence of 
the finite dimensional distributions, and then establish the tightness in D[0, 1]; even if the proof 
is a bit technical, we think that several tricks make it interesting in its own right. 

2 Proof of Theorem 1 

The proof starts with that of the convergence of the finite dimensional distributions (FDD) 
convergence of Y n : this is classical as we will see. Let 9q := < 9\ < 9<i < • • ■ < 6k = 1 for some 
K > 1 be fixed. In the sequel, for any function (random or not) L indexed by 9, AL(8j) will 
stand for L(9j) - £(0j-i). For any t < K 



where by convention Z n (N n (9^i)) = Z{9^\) = 0. The convergence of the FDD of Y n follows the 
convergence in distribution of the increments (AY^(0^),O < £ < K). Notice that 



If for some j, Qj-\ and 9j are chosen in such a way that AF(9j) = then the jth increment in 
(7) is almost surely (this is the case for the 0th increment if /t/({0}) = 0). We now discuss the 
asymptotic behaviour of the other increments : let J = {j £ {0, . . . , K} : AF(9j) 7^ 0}. 

Let (rij,j £ J) be some fixed integers summing to n. Denote by iu,g j l: g j the law of X condi- 
tioned by {0j-\ < X < 9j}. Conditional on (N n (9j) = rij,j £ J), the variables AZ n (N n (9j)), 
j £ J are independent, and AZ n (N n (9j)) is a sum of rij — nj—\ i.i.d. copies of variables under 
y-Oi-xfii-, denoted from now on (X ._ li& .(k),k > 1). 

We thank Emmanuel Rio for this information 



AY n (9 £ ) = ^[AZ n (JV n (^)) - AZ{9 3 )} 



(7) 




(8) 
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Since (AN n (6j),j 6 J) ~ Multinomial (n, (AF(9j),j G J)), 
( AN n {9 j )-nAF{9 j ) \ (d) 



,j GJ -^(Gj-.j GJ) (9) 

V v n ) n 

where (Gj,j G J) is a centered Gaussian vector with covariance function, 

cov(G fc , G<) = -AF(9 k ) . AFiOi) + l k=e AF(9 k ), 

formula valid for any < k,£ < K. Putting together the previous considerations, we have 

AiUpi) j, x „( m ))-E(f(X e . , g .)) 
AY n (6j) = r 3 ~ 1,J ' ! (10) 



Jn 

m=l 



Using (9) and the central limit theorem, we then get that 



(AY n (9j),0 < 3 <K)^% \ J AF{0j)Gj + G,E(/(Z % _ l!% )), < j < Kj (12) 

where the family of r.v. (Gj,j < K) and (Gj,j < K) are independent, and the r.v. Gj are 
independent centered Gaussian r.v. with variance Var(/(Xg._ 1) 0.)) (this allows one to determine 
the variance and covariance (5) and (6)). Notice that here only the fmiteness of \/ar(f(Xg_ li 0.)) 
and E(/(X^._ 1)6 i j .)) are used. 

It remains to show the tightness of the sequence (Y n ,n > 0) in D[0, 1]. A criterion for the 
tightness in D[0, 1] can be found in Billingsley [1, Thm. 13.2]: a sequence of processes (Y n , n > 1) 
with values in D[0, 1] is tight if, for any e G (0, 1), 

lim limsupP(u/(F n , S) > e) = 

where u'(f,5) = inf^.) maxj sup S)tg [ t . _ 1 j.\ \f(s) — f(t)\, and the partitions (tj) range over all 
partitions of the form = to < t\ < ■ ■ ■ < t n < 1 with minjij — 1 < i < n} > 8. 

We now compare our current model formed by a set {X\, . . . ,X n } of n i.i.d. copies of X 
denoted from now on by P n , with a Poisson point process P n on [0, 1] with intensity nfx, denoted 
by Pp n . Conditionally on #P n = k, the k points P n := {X[, . . . , X' k } are i.i.d. and have 
distribution //, and then Pp n ( • |#-P = n) = P n . The Poisson point process is naturally equipped 
with a filtration a := {a t = a({P n [0,t]}),t G [0, 1]}. 

We are here working under Pp n , and we let N(9) = #(-P n H [0, 9]); notice that under P n , N 
and N n coincide. 

Before starting, recall that if N ~ Poisson (6), for any positive A, 

P(N > x) = F(e XN > e Xx ) < E(e XN ~ Xx ) = e -^ A -Ax ^ 
F(N < x) = F(e~ XN > e~ Xx ) < E(e~ XN+Xx ) = e -b+be~*+\x ^ 
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We explain now why the tightness of (Y n ,n > 1) under Pp n implies the same result under P n . 
Let m = inf{x £ [0, 1], F(x) > 1/2} be the median of fx. 

Lemma 2.1. There exists a constant 7 (which depends on such that for any a m -measurable 
event A, 

F n (A) = F Pn (A \#P = n)< 7 Pp„(A). (15) 

Proof of the Lemma We have 

p s \- PPn (A #Cg n [0, m]) = fc)P(#P n [m, 1] = n - fe) 
P Pn (A|#P = n) = ^ P(# P = n) 

< 7lPp„(^4) 

where 7 = sup n>1 sup fc / F ^^|^plj.~"~ fc - , which is indeed finite since P(#-P = n) ~ (2<7rn) -1 / 2 , 
and since i^P [m, 1] ~ Poisson(n/2), and then the probability that its value is A; is bounded 
above by some dj\fn according to Petrov [6, Thm. 7 p. 48]. □ 

Thanks to Lemma 2.1, if the sequence of restrictions (Y n \ [0, m],n > 1) o£Y n on [0, m] is tight 
on D[0, m] under ¥p n then so it is under P n (the same proof works on D[m, 1] by a time reversal 
argument). To end the proof, we show that (Y n \ [0, m], n > 1) is indeed tight under Pp n . 

Take then some (small) 77 € (0, 1), e > 0; we will show that one can find a finite partition 
(ti,i € /) of [0, m] and a 5 £ (0, m) such that 

lim sup P n (u/(F n , 5) > e) < 77, (16) 

n 

which is sufficient for our purpose. 

We decompose the process Y n as suggested by (10) and (11), 

Y n {6) = Ytt9) + YZ(9) (17) 

where 

m=l 

(If F{9) then set Y£{0) = instead of (19)). 

The tightness of each of the sequences (Y^,n > 1) and (Y",n > 1) in D[0, 1] suffices to 
deduce that of (Y n , n > 1). We then proceed separately. 
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Tightness of {Y^ n>l) 

To control the jumps of Y^, we will need to localise the large atoms of fi. Let A = {x G 
[0, m], /j({x}) > 0} be the set of positions of the atoms of \i in [0,m], and let A- a := {x G 
A : n({x}) > a}. Clearly #A- a < 1/a and [0, m] \ A- a forms a finite union of open connected 
intervals (O x ,x G G), with extremities (t'^i G I). The intervals (O x ,x G G) can be further cut 
as follows: 

- do nothing to those such that n(O x ) < 2a, 

- those such that fJ,(O x ) > 2a are further split. Since they contain no atom with mass > a, they 
can be split into smaller intervals having all their weights in [a, 2a] except for at most one (in 
each interval O x which may have a weight smaller than a). 

Once all these splittings have been done, a list of at most 3/a intervals are obtained (in fact less 
than that), all of them having a weight smaller than 2a. Name G a = (O x ,x G I a ) the collection 
of obtained open intervals, indexed by some set I a , and by (t^,i > 0) the partitions obtained. 
Take O one of these intervals. One has i^(P n H O) is Poisson with parameter nfj,(0) < na. 
Consider again (10), (11) and Hyp. Set, for any L > 1, 




Let 



u(Y;,0) = sup{|F^(s) - Yn(t)\,s,te 0} 
be the modulus of continuity of Y^ on O. We have, for any a G (0, 1/2) 
F(lo(Y^O)>x) < pf|#(P n nO) -nfi(0)\ > n 1/2+a 




(20) 



where 



n/i(0) — n 



l/2+a 



nfi(0) + n 



l/2+a 



Using (13) and (14), one sees that 




for some c > 0, d > and n large enough (for this take x = nfi(0) + n 
and, x = n/i{Q) - n l / 2+a , A = 1/y/E in (14)). 



/ 2 + Q , A = 1/y/E in (13) 



Let us take care of the second term in (20). Clearly, 



sup { — Sj ,i,j<L\ = max sj — minS 1 - 

L J ) i<L i<L 



in) 



According to Petrov [6, Thm.12 p50], 
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and then 



(maxSf } > x^j < 2P (s[ n) >x- C n {0) 



for C n (0) = ^/ 2L ^ ( }o y^, an d a similar inequality holds for minj<£ 5^. Since 

P f max5, W - min5, (n) > x ) < P f maxSj n) > x/2 ) + P ( -minSf^ > s/2 ) 
\i<L 1 j<L i J \i<L 1 ) \ j<L i J 

< 2P (S^ > f - C n (0)) + 2P < ~ + C n (0) ) . 



To get some bounds, we use the central limit theorem for and take x = e, o > such that 

T(a) = e 2 5 2 for some small 5 > (recall that T is increasing and therefore invertible), and any 
sequence L n such that L n /n — > fi(0) (any sequence L = L n such that L n G T n satisfies this, and 
then we can control the supremum with this method). We have 

P(s (n) >--C7 (O)) = v( ^ - z/2-C n (Q) 

V L ~ 2 nK >) \ v / /U (0)Var(/(X )) ~ ^(0)Var(/(X )) 



For n large enough, 



C n (0) < V4T(/i(0)) < 2e,5 

and therefore 

hmsupP(4 n) > --C7 n (0)) < $' 



2 V ~ VV/^(0)Var(/(X )); 

where $ is the tail function of the standard Gaussian distribution. 

Finally, if 5 is chosen sufficiently small (25 < 1/2), since /i(0)Var(/(X )) < T(n(0)) < 
T(a) = £ 2 5 2 , then on each interval O G G a , 

1/2 - 2<T 



sup 



<L\>e) < 4$ 



and this independently of the choice of the interval O in G a , for n large enough. 

The control of the intervals all together can be achieved using the union bound : since they 
are at most 3/T~ 1 (e 2 5 2 ) such intervals, by the union bound 

p* ( -r K ' 0) £ E ) K - r4m ( 4 * (^) + «-*") • 



= 2 , 

and 7 > there exists a 5 sufficiently small such that 



Since $(x) ~ exp(—x/2)/(y/2irx), and T(x) ln(x) — )• 0, which implies that for any e > 0, 



T(e^ 7/<52 ) < e 2 5 2 or equivalently r _i^ 2< y 2 ) < ^ 
and as a result the probability can be taken as small as wanted. □ 
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Tightness of {Y£,n> 1) 

Recall (19). We work here under P n and we only consider the interval I = {9 : F{9) > 0} 
since Y"(8) equals on its complement. Since on /, 8 \— > is cadlag (and does not depend 
on n), it suffices to see why ^ N ^( s )-^ F ( e ) ^ n > j s tight in D[0, 1], but this is clear since this is 
a consequence of the convergence of the standard empirical process (Donsker [2]). □ 
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